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FOREWORD 

The  r>"  '  mis  report  is  to  describe  results  of  studies  in  sound 

Ipr'1’:.'  earned  out  over  a  3-ycar  period.  During  this  time  trie  role 

•'  uic  external  ear  in  sound  localization  was  established  and  devices  were 
constructed  to  provide  means  for  localizing  sounds  in  other  environments. 

This  work  was  conducted  by  United  :’f*search,  Inc.,  Cambridge,  Mass., 
under  Contract  No.  N123(6P530)  32279A  issn  d  by  the  U.S.  Naval  Ordnance 
Test  Station.  The  information  he  rein  covers  wo  ’•  that  completed  this  con¬ 
tract  in  December  1963. 

This  is  Part  4  (Final  Report)  of  a  series  of  reports  that  •• r ■  issued 

covering  various  aspects  of  the  subject.  The  titles  of  Parts  1,  2,  ,n.  i  ’ 
arc  as  ioliows: 

Part  1,  Characteristics  of  Human  Localization  of 
Sound  (December  1962) 

Pa  it  2.  The  Mechanism  of  Human  Localization  of 
Sounds  With  Applications  in  Remote  Environments 
(Deci.  nl, ,  .  1962) 

Part  3.  A  New  Tneoi  of  Human  Audition  (December 
1963) 

The  text  pages  in  this  report  were  reproduced  m  •  •d".iite  from  a 
report  issued  by  the  contractor. 

WM.  B.  MCLEAN 
Technical  Director 
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CHAPTER  1 
SUMMARY 


1 .  I  Introduction 

The  work  reported  here  was  done  under  U.  S.  Naval  Ordnance 
Test  Station  Contract  No.  N123-(6QS3C)  32279A  to  continue  research  in 
human  localization  of  sound  begun  under  earlier  contracts  and  reported 
in  references  1  .  2  and  3. 

tally  attempts  to  reproduce  me  i unction  ot  human  localization 
by  electronic  means  both  in  air  and  underwater  disclosed  the  need  for 
advances  in  the  state-of-the-art  in  microphones ,  headphones,  hydrophones, 
and  a  greater  knowledge  of  the  effects  of  mechanical  structure  between 
the  ears.  A  study  was  undertaken  to  resolve  these  problem  areas,  which 
were  divided  into  four  prime  tasks.  ( i >  To  study  improvements  in 
hydrophones  for  adaptation  to  underwater  listening  systems  utilizing 
ear  ,'eulicas;  (2)  to  study  the  effects  of  mechanical  structure  between 
the  ears  on  localization;  (3)  to  study  lmpiovements  In  headphones  for 
use  In  localization  systems;  (4)  to  continue  the  study  of  the  transiormation 
function  of  the  pinna. 

Two  additional  tasks  were  included  to  develop  devices  for  use 
in  man-to-porpoise  and  porpoise  to- man  communication.  The  design 
concept  for  these  translators  was  an  outgrowth  of  localization  research 
which  provided  new  insight  into  the  recognition  of  human  speech. 


1 . 2  Results 
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Headphones,  on  the  other  hand,  were  made  which  accurately  transferred 
the  localization  function.  The  study  of  the  effects  of  structure  showed 
the  independence  of  interaural  distance  and  tne  localization  function. 
Significant  progress  was  realized  in  the  theoretical  aspects  of  our  study 
wherein  consistent  theorems  for  finding  the  inverse  transformation  have 
been  formulated  and  the  processing  schematics  dravrn  which  closely 
parallel  the  known  structure  of  neurai  networks 

It  has  been  established  that  the  porpoise  communicates  in  his 
environment  by  modulated  whistles.  Two  translators  were  made  .  the  first 
of  which  translates  porpoise  whistles  to  human-like  sounds  while  the 
second  translates  in  the  other  direction.  Two-way  communication  was 
thus  provided.  In  <'xpefimfcnts  at  Point  Mugu  we  succeeded  in  getting 
the  porpoise  to  mimic  certain  sounds. 


CHAPTER  2 


EXPERIMENTAL  DEVELOPMENTS 


2.  1  Underwater  Tests 

A  series  of  tests  were  conducted  with  the  underwater  listening 
system  shown  in  Figures  2.1-1,  2.1-2  and  2.1-3.  The  ears  are  cast 
from  stainless  steel  and  are  4.5  times  norma!  size.  The  spherical 
covering  is  3/8"  thick  Absonic  A.  Clevite  CH-12  Oyster  hydrophones 
were  used  in  this  system.  Its  response  is  shown  in  Figure  2.  1-4. 

The  first  series  of  tests  were  conducted  at  Monis  Dam  Test 
Facility  and  were  designed  to  ascertain  the  navigational  utility  of 
aural  coupling  to  water.  The  listening  system  was  suspended  3C  feet 
below  a  navigable  barge.  The  subject  was  seated  on  the  deck  and 
covered  so  that  no  visual  references  were  possible.  A  noise  source 
shown  in  Figure  2. 1-5  was  lowered  to  various  depths  at  distances  up 
to  500  feet  away.  The  clapper  was  struck  by  pulling  on  a  line 
attached  to  it.  The  subject  listening  through  the  pickup  indicated 
the  direction  the  operator  should  proceed.  With  the  sound  source 
fixed,  the  barge  was  navigated  to  it  every  lime  using  only  aural 
information . 

The  experiment  was  conducted  as  follows.  The  sound  source 
was  moved  to  an  unknown  location  by  outboard  motor  boat  and  lowered. 
The  subject  was  isolated  during  this  time.  When  the  sound  began., 
the  subject,  now  listening  through  the  underwater  pickup.  Indicated 
the  direction  the  barge  operator  was  to  go.  As  the  barge  turned  the 
sound  was  perceived  to  move  to  the  front.  Straight  ahead  direction 
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Figure  2.1-2  Enlarged  Ear  Replica  in  Underwater 
Listening  System 
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Figure  2.1-3  Experimental  Underwater  Listening  System 
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Figure  2.  1-4  Response  of  Oyster 


was  then  given  and  small  corrections  thereafter.  It  is  significant  to  point 
out  that  no  subject  made  gross  errors  in  direction  which  he  iater  corrected. 

The  second  series  of  tests  were  conducted  in  January  1963  off 
San  Clemente  Island.  These  tests  were  designed  to  provide  information 
or  data  on  system  performance  in  the  ocean  environment.  The  ear 
system  was  lowered  over  the  stem  of  a  Torpedo  Recovery  Boat  to  a 
depth  of  fifty  feet.  The  first  day  was  spent  near  shore  where  marine  life 
noise  was  very  evident,  snaps,  crackles,  and  pops  apparently  made 
by  crustaceans.  We  also  listened  to  small  craft  and  noises  made  by 
divers  hammering  on  buoys  and  on  a  barge  bottom.  localization  was 
equivalent  to  the  expectation  for  the  system 

Moving  out  to  sfcout  two  miles  left  ine  areas  of  crackling 
noise.  Again,  with  the  ears  at  50  feet,  the  sea  state  (about  one 
half)  was  audible,  as  was  a  sonar  of  about  9  kc,  source  unknown. 
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Figure  2. 1-5  Sound  Sou rep  Used  in 

Navigational  Utility  Tests 
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On  the  second  day  we  set  out  into  deep  water  about  six  miles 
from  San  Clemente.  Enroute  a  school  of  migrating  pilot  whales 
was  encountered  and  their  whistles  and  ranging  sounds  recorded. 

Upon  reaching  deep  water  the  listening  system  was  again  lowered 
to  a  depth  of  50  fee  t,  Sonar  signals  porpoise  and  whale  sounds  as 
well  as  a  sound  of  arc  welding  were  heard. 

On  the  third  day  the  listening  system  was  installed  in  the 
porpoise  pool  at  Pacific  Ocean  Park,  Santa  Monica.  Unfortunately, 
the  animals  were  very  quiet  this  day. 

Finally  more  tests  were  made  in  the  Long  Beach  area 
The  ambient  sea  noise  level  was  very  high  and  although  localization 
was  satisfactory,  range  was  very  limited 

The  results  of  these  tests  established  the  possibility  and 
the  utility  of  binaurai  coupling  to  the  undprwater  pnvimnmpnt. 

Effort  was  then  directed  toward  improvements  in  the  system  components. 

2  .  2  Hvd ronhones 

The  problem  of  selecting  a  hydrophone  for  adaptation  to 
underwater  listening  systems  prompted  original  efforts  in  hydrophone 
design  o  incorporate  the  features  known  to  be  necessary  for 
adequate  aural  coupling.  These  requirements  are  (1)  Frequency 
response  in  free  field  from  40  cycles  per  second  to  70,000  cycles 
per  second  within  +  2  db;  (2)  detection  threshold  5  db  below  sea 
state  zero  to  10  000  cycles;  (this  implies  high  element  sensitivity, 
approximately  -85  to  -90  db  re  lv/iibar  and  a  correspondingly  low 
preamplifier  noise  level,  -140  to  -145  db);  (3)  configuration  and 
size  so  as  to  span  the  entrance  of  the  ear  canal;  (4)  capable  of 
operation  to  5000’  depth. 
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in  a  survey  of  available  commercial  hydrophones  no  ops 
unit  was  found  to  meet  all  these  requirements.  A  compromise  was 
made  to  use  Clevite  CH-12,  Type  O  hydrophones  in  the  initial  under¬ 
water  listening  system  primarily  because  of  its  physical  size  and 
cost.  With  allowance  for  limited  hydrophone  characteristics,  the 
underwater  listening  system  worked  as  expected,  as  reported  above. 
However,  improvements  were  desirable.  To  this  end,  a  program  to 
develop  a  hydrophone  particularly  for  mounting  behind  an  ear  was 
undertaken.  It  was  decided  to  use  capacitance  change  techniques. 

It  seemed  possible  to  design  a  hydrophone  along  the  lines 
of  a  condenser  microphone  by  using  plastic  dielectric  material, 
in  order  to  obtain  an  effective  high  potential  while  using  a  low 
supply  voltage,  a  scheme  of  parallel  charging,  series  discharging 
was  designed.  This  is  illustrated  In  Figure  2.2-1. 


Figure  2.2-1  Parallel  Charge  —  Series  Discharge 
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The  signal  voltage,  e  is  given 


®12  +  e23  +  e34  +  e4*5 


-  «12c;1  +  «J30j\+  «.)°3  *  W'i1 


(2.2-1) 


It  is  assumed  that  all  are  equal  in  magnitude  and  of  alternate  sign, 


so 


(c"1  -  c:1  +  c:1  -  c:1) 


es  -  q  "■'!  ~2  ■  ~3  '’4 


(2.2-2) 


It  is  also  assumed  that  static  capacitance  ar  •  all  equal,  so  that  the 
d.c.  signal  is  zero.  The  change  in  signal  is  derived  from  changes  in 
capacitance  due  to  pressure 

de  n 

*  -  <■  2  (-'I*  ft  <°1’>  12.2-3) 

1  =  0 

The  basic  capacitance  is  given  by 


r* 

kA 

V—1 

O 

d 

0 

k 

=  dielectric  constant 

A 

=  area 

d 

0 

=  static  separation 

Under  pressure,  the  true  spacing,  d,  is  given  >f 

d 

"  do(1"a> 

P 

=  pressure 

o 

=  modulus  of  elasticity 

(2.2-4) 
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Thus  the  change  in  inverse  capacitance  with  time  can  be  written 


dC 

dt 


d.  lo  (1_P 

dt  kA  '  a  1 


o  dP 
akA  dt 


(2.2-6) 


'.  noosing  different  cr  materials  for  alternate  capacitors  permits  us  to  write 
de  nq  d 


dt 


- -  !— 

2  kA  O 


1 


J_.  dP 
°2  dt 


(2.2-7) 


i.ius  gaining  a  larger  signal  voltage  by  increasing  the  number  of  elements, 
l-Vperimental  materials  chosen  were  mylar  and  polypropylene  for  which 


mylar 


300  x  10  psi 


1.5  x  10  psi 


polypropylene 

Equation  (2.  2-7)  may  bo  rewritten  in  terms  of  the  polarization  voitage: 


de 
_ s 

dt 


—  v  r  ~ —  —  — - 1  — 

2  o  L  a1  1  dt 


(2.2-8) 


since 


kA/d 


Se..  iitivity  is  defined  as 


S  * 


de 
_ s 

dP 


Hep.  e 


(2.  2-9) 
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For  a  100  element  sensor  and  V  =  '’hits 

o 

S  =  6  volts/psl  =  100  n  volts/ /x  bar  =  -30  db  re  1  v/jubar 

Certain  difficulties  are  inherent  in  making  this  hydrophone. 
The  charging  resistance  R  in  Figure  2.  2-1  must  be  approximately 
1000  megohm  and  be  achievable  in  a  film.  Figure  2.2-2  shows  a 
typical  element  and  the  techniques  to  be  used  in  providing  electrical 
continuity. 


1  mil  mylar  or  polypropylene 


Figure  2.2-2  Typical  Flei  mt 

The  complete  sensor  is  made  by  alternately  stacking  mylar  and 
polypropylene . 

In  spite  of  the  promising  features  of  this  design,  construction 
had  to  be  abandoned  in  favor  of  a  more  conventional  technique 
when  details  of  fabrication  and  assembly  proved  too  difficult  to 
overcome . 

An  experimental  hydrophone  shown  in  Figure  2,2-3  was  tested 
by  U.  S.  Navy  Electronics  Laboratory.  Figure  2.2-4  is  the  frequency 
response  characteristic.  The  element  used  in  this  hydrophone  was 
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Figure  2.  2-4 
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constructed  as  shown  in  Figure  2.2-5.  From  the  response  curve 
it  was  concluded  that  the  mass  of  the  fiber  glass  to  a  large  extent 
limited  the  high  frequency  end.  A  second  element  was  made  as 
shown  in  Figure  2.2-6.  It  was  installed  in  the  steel  flange  to  which 
the  ear  was  fastened,  Figure  2.  2-7.  Unfortunately,  no  adequate 
test  data  was  acquired  due  to  breakdown  in  test  facilities.  Arrange¬ 
ments  were  made  to  use  facilities  at  U.  S.  Sonics  Corporation, 
Cambridge  but  it  was  found  that  the  techniques  and  equipment 
used  there  were  inadequate  for  broad  band  testing  required.  A 
binaural  listening  system  was  nevertheless  constructed  using  these 
hydrophones  and  tested  in  a  nearby  lake.  The  performance  of  these 
hydrophones  was  unsatisfactory  due  in  part  to  the  rapid  decrease  in 
sensitivity  with  depth.  At  fifty  feet  no  audible  signal  was  heard  despite 
the  presence  of  a  scuba  diver  in  the  vicinity  at  the  same  depth  knocking 
rocks  together.  Further  development  of  condenser  hydrophones  was 
subsequently  abandoned. 

Later  hydrophone  work  with  the  Massa  TR-14A,  the  Clevite 
CH-13  and  Chesapeake  TP-120  revealed  an  anomaly  in  observations 
made  in  earlier  work.  It  was  found  and  reported  earlier  that  the  output 
of  the  broad  band,  secondary  standard  TP  120  hydrophone,  when  brought 
close  to  polyurethane  foam,  decrease  appreciably.  Similarly,  when 
it  was  enclosed  behind  the  ear  canal  opening ,  the  behavior  was  the 
same.  These  effects  were  not  observed  with  the  CH-13  or  the  Massa 
TR-14A.  A  known  difference  between  these  hydrophones  was  the  type 
of  piezoelectric  crystal  used.  The  TP- 120  utilizes  lithium  sulfate, 
sensitive  to  hydrostatic  pressure  fields  while  the  others  use  barium 
titanate,  sensitive  to  directed  acoustic  pressure.  However,  after 
repair  and  recalibration  of  the  TP-120  necessitated  by  damage  incurred 
in  transit,  it  was  found  that  it  no  longer  behaved  differently  than  the 
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Figure  2.2-5  First  Experimental  Hydrophone  Element 


Figure  2.2-6  Can— Type  Hydrophone  Element 
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other  hydrophones.  In  addition ,  it.  was  further  observed  that  the 
responses  of  all  three  hydrophones  were  now  similar  whether  the 
ear  behind  which  they  were  placed  was  made  of  polyurethane  foam 
or  solid  stainless  steel.  This  disconcerting  behavior  is  as  yet 
unexplained.  However,  it  has  emphatically  indicated  that  the  ease 
of  adapting  microphones  to  ears  for  air  localization  systems  is  not 
paralleled  in  the  water  environment.  Despite  these  negative  results, 
the  data  now  on  hand  will  allow  greater  progress  to  be  made  in 
providing  effective  aural  coupling  to  water. 

2. 3  Structures 

The  diffraction  and  reflection  effects  Df  any  structure  within 
the  lnteraural  distance  has  frequently  been  assessed  a  high  value  of 
importance  in  localization.  Although  it  has  been  stated  that  the  mental 
function  utilizes  all  possible  information  in  localization,  we  were 
interested  to  know  if  some  of  these  clues  might  be  eliminated  without 
impairing  localization  ability.  To  this  end  three  binaural  pickups  were 
made  as  shown  in  Figures  2.3-1,  2.3-2  and  2.3-3.  In  order,  John  II 
maintains  the  features  of  a  human  head,  although  lacking  hair- like  material, 
the  significance  of  which  is  unknown;  Coco,  a  featureless  head,  and 
Peter,  no  head. 

In  each  case  the  ears  were  separated  by  the  equivalent 
acoustic  distance,  10-3/4  inches.  Subjective  responses  Indicated 
that  in  all  cases  the  ability  to  localize  was  neither  impaired  nor 
aided  by  the  lack  or  presence  of  structure  between  the  ears!  This 
was  a  gratifying  result  and  was  pursued  to  certain  extremes.  The 
ears  on  Peter  (no  head  at  all!  )  were  moved  close  together  until 
separated  by  four  inches  then  moved  out  until  separated  by  34  inches. 
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Figure  2.  3-2  Coco 
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In  both  cases  localization  was  unaffected  if  the  sound  source 
remained  outside  the  spherical  space  whose  diameter  is  defined 
by  the  lnteraural  distance.  Thus  it  appears  that  localization 
depends  more  on  autocorrelation  of  the  pinna  transformed  sound  than 
on  the  cross  correlation  between  ears,  which  has  been  found  to 
be  independent  of  interaural  distance. 

Effective  aural  coupling  with  binaural  pickups  has  been 
improved  by  the  awareness  of  the  importance  of  acoustic  isolation 
and  by  the  choice  of  proper  material  out  of  which  ears  are  cast. 

Figure  2.  3-4  is  a  section  drawing  of  the  ear-microphone 
assembly  and  the  means  used  for  acoustic  isolation. 


Aluminum  Case 


Figure  2.3-4  Far-Microphone  Assembly 
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From  previous  examination  of  the  ultrasonic  range  20  KC  to 
40  KC  it  appeared  that  the  acoustic  coloration  of  the  material  from 
which  the  pinnae  are  constructed  could  be  of  importance.  This  has 
been  found  to  be  true  and  the  coloration  of  silicone  rubber  from  which 
the  first  ear  replicas  were  made  is  much  too  red  for  best  use.  Pinnae 
were  cast  from  different  epoxies  and  the  coloration  measured. 

Figures  2.3-5  and  2.3-6  illustrate  the  different  colorations  of  each 
material  and  permits  a  comparison  with  a  human  ear,  the  data  for  which 
was  measured  by  inserting  a  Bruel  &  Kjaer  Model  4135  (1/4"  diameter) 
microphone  cartridge  into  the  subject's  ear  canal  and  locating  his  ear 
in  the  same  position  as  the  plastic  ears  by  means  oi  a  jig  made  for  this 
purpose.  From  a  comparison  of  the  results,  steel  filled  epoxy  approaches 
a  human  ear  best.  More  work  is  necessary  in  this  area  before  the 
optimum  material  is  found.  It  will  be  shown  in  a  later  section  of  this 
report  that  the  transformation  provided  by  the  pinna  may  be  expressed 
as  the  Laplace  transform  of  the  delay-dependent  reflection  coefficients. 
The  significance  of  material  selection  is  now  known. 

2, 4  Headphones 

We  have  long  been  aware  of  the  limitations  of  available  head¬ 
phones  for  use  In  localization  work.  The  requirements  of  flat  frequency 
response  over  a  broad  bandwidth  in  an  insertion  type  headphone  cannot 
be  met  by  any  available  product.  The  extent  of  the  limitations  were 
not  known  however.  Therefore,  the  frequency  response  of  a  headphone- 
microphone  system  using  the  mannequin  head,  John  II,  as  part  of  the 
equipment  was  determined.  Various  headphones  were  placed  on  the 
head  and  driven  by  an  oscillator.  The  output  of  the  microphone  behind 
the  ear  was  recorded.  The  results  are  shown  in  Figures  2.4-1  and  2.4-2. 
The  poor  headphone-microphone  response  substantiated  the  need  for 
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Figure  2 . 3-6  Acoustic  Coloration 


Beyer  DT-508  semi-insertion  headphone  measured 
on  mannequin  head  equipped  with  AKG  CK-26 
cartridge  and  CK-60  follower 
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Figure  2.4-1  Insertion  Headphone  Response 
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Figure  2.4-2  External  Headphone  Response 
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extensive  work  in  this  area  if  established  requirements  were  to  be 
fulfilled. 

The  block  diagram  of  Figure  2.4-3  illustrates  a  single  channel 
of  a  binaural  localization  system. 


Ear  Mike  Electronics  Headphone 

Figure  2.4-3  Single  Channel  of  Localization  System 


The  output  signal  Sq  is  the  transformed  input  signal  and  may 
be  written: 

IVmWhI  si  '  so 

Tg  —  transformation  function  of  the  ear 

—  transformation  function  of  microphone 

Tg^  “  transformation  function  of  electronics 

Tjj  —  transformation  function  of  headphone 
For  localization 


that  is,  the  transformation  introduced  by  the  pinna  should  be  faithfully 
reproduced  by  the  Intervening  components.  Therefore 

s.  ‘  h 

or 

T-.T_.-T_.  =  1  over  the  bandwidth  from 

"'*■  1  40  -  40 ,000  cycles  per  second 
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The  fidelity  of  electronic  equipment  is  readily  achieved. 

Therefore  T_t  can  be  made  unity.  However,  the  combined  effects  of 

t  i-i 

the  headphone  and  microphone  create  certain  difficulties.  Fortunately, 
the  state  of  the  art  of  microphone  design  and  manufacture  has 
progressed  rapidly  within  the  past  few  years.  There  are  currently 
available  microphones  with  a  bandwidth  extending  from  20  cycles 
per  second  to  80,000  cycles  per  second  within  +  3  manufactured  by 
Bruel  &  Kjaer  of  Denmark.  However,  the  state  of  the  art  in  head¬ 
phone  development  Is  not  as  far  advanced.  Headphones  specified 
from  15  cycles  per  second  to  15,000  cycles  per  second  are  found 
to  have  many  sharp  resonances  within  this  bandwidth.  Those  resonances 
effectively  distort  the  delays  Introduced  by  the  ear.  Two  major 
directions  of  headphone  development  were  therefore  pursued: 

(1)  diaphragm  magnetic  and  (2)  capacitor. 


The  diaphragm  magnetic  headphone  with  the  conductors  integral 


wit 


/nodmon+a  1 
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model  had  output  sound  pressure  level  of  85  db  with  250  mw  power  input 
and  a  frequency  response  between  100  cps  and  35  kcps  measured 
against  a  Bruel  &  Kjaer  4133  microphone,  DC  impedance  was  1.5  ohms. 
However,  the  2  mil  thick  bakeiite  diaphragm  made  by  Baldwin  Lima 
Hamilton  Co.  showed  many  resonances  between  5  kc  and  25  kc. 

Figure  2.4-5  shows  the  test  results. 

The  difficulties  offered  by  this  design  included  (1)  choice  of 
diaphragm  material,  (2)  attachment  of  conductors  to  the  diaphragm, 

(3)  mounting  of  the  diaphraqm  on  the  magnet  to  improve  magnetic 
coupling  and  increase  sensitivity.  While  none  of  these  appears 
Insurmountable,  practically  they  are  difficult  to  attain  without  affecting 
sensitivity  frequency  response  or  size. 

Headphone  elements  made  by  NOTS  personnel  In  which  the 
conductors  were  printed  on  the  diaphragm  were  tested  and  showed  the 
same  features  as  earlier  models.  Further  development  of  this  principle 
was  terminated  In  favor  of  an  insertion  condenser  element.  Excepting 
for  the  low  sensitivity  of  the  magnetic  headphone,  it  was  eminently 
suitable  to  experimental  needs. 

The  first  condenser  headphones  employed  Bruel  &  Kjaer  Model 
4135  1/4"  diameter  microphone  cartridges.  These  had  limited  dynamic 
range  and  low  sensitivity.  However,  the  response  characteristic  was 
better  than  any  headphone  element  tried  and  provided  the  first  unequiv¬ 
ocal  transfer  of  the  localization  function  by  aural  coupling.  Since 
these  cartridges  are  expensive  and  limited,  as  noted,  a  capacitor 
headphone  using  .15  mil  aluminized  mylar  was  developed  which  provides 
greater  sensitivity  and  wider  dynamic  range  at  considerably  less  cost. 
Figure  2.4-6  shows  a  binaural  condenser  headset. 
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In  order  to  drive  the  condenser  headset  a  high  gain  binaural 
amplifier,  Figure  2.4-7, was  designed  to  provide  +300  volts  DC 
polarization  and  a  maximum  undistorted  signal  of  84  volts  RMS. 

The  specifications  for  this  unit  are: 

Gain:  54,000:1 

Freq.  Response:  12  cps  -  74  kcps  +  1  db 

Noise:  Less  than  1  /uvolt  RMS  referred  to 

input 

Hum:  16  ju  volts  RMS  referred  to  input 

Max.  Undistorted  Output:  84  volts  RMS 

Harmonic  Distortion:  40  volts  RMS  -  3%  2nd  harmonic 

20  volts  RMS  -  1.5%  2nd  harmonic 
10  volts  RMS  -  1%  2nd  harmonic 
Polarization  Vo'tage:  +  300  volts  DC 

Acoustic  evaluation  of  headphones  in  closed  cavities  is  complicated 
by  cavity  resonances  for  frequencies  above  about  14  kcps.  As  a  result, 
a  simple  device  was  constructed  to  measure  diaphragm  displacement 
of  a  condenser  headphone  In  order  to  infer  the  acoustic  pressure  response 
and  the  effects  of  loading  as  when  coupled  into  the  ear  canal.  Operation 
was  based  on  varying  the  capacity  of  a  circuit  tuned  to  21.4  me.  The 
output  is  a  voltage  linearly  proportional  to  the  displacement  of  the 
diaphragm  over  the  range  of  interest.  Figure  2.4-8  is  the  response 
curve  of  a  condenser  element  found  in  this  manner. 

2 . 5  Evaluation  of  Aural  Coupling 

2.5.1  Localization  Tests 

Earlier  work  established  the  accuracy  of  human  localization  of 
sounds  in  a  real  environment,  both  binaurally  and  monaurally  (Ref.  2). 
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Figure  2.  4-7  Condenser  Headphone  Driver 
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Figure  2.4-8  Condenser  Headphone  Diaphragm  Response 
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In  order  to  adequately  evaluate  the  performance  of  any  underwater  sound 
localization  system,  it  is  necessary  to  know  the  accuracy  achievable 
when  a  listener  is  coupled  to  another  environment  by  means  of  the 
John  II,  or  air  system.  From  the  standpoint  of  operational  utility  it  is 
also  desirable  to  know  the  learning  time  required  before  optimum  accuracy 
is  realized.  A  test  was  designed  to  determine  these  factors. 

Eight  3"  speakers  were  equally  spaced  on  a  12-foot  diameter 
ring  and  the  mannequin  head  located  at  the  center  atop  a  tripod  facing 
one  of  the  speakers,  Figure  2.5-1.  The  head  was  six  feet  above  the 
floor  with  the  speaker  ring  positioned  at  ear  level.  The  subsequent  tests 
involved  azimuth  only.  A  control  box  with  eight  switches  was  provided 
to  permit  selection  of  any  one  speaker  by  the  testor,  The  subject  who 
was  in  a  separate  room  had  a  similar  panel  on  which  eiyht  switches 
were  placed  corresponding  to  the  speaker  -  ring  layout,  Figure  2,  5-2. 
When  a  speaker  was  energized  by  the  testor,  the  subject  pressed  the 
button  corresponding  to  the  position  from  which  the  sound  appeared  to 
originate.  For  a  correct  selection,  a  green  light  flashed  on  both  the 
subject's  panel  and  the  testers  panel.  If  the  initial  choice  was  incorrect, 
a  red  light  flashed  and  another  selection  was  made.  This  was  repeated 
until  the  correct  choice  was  made.  The  data  recorded  were: 

1.  position  of  speaker  selected  relative  to  head;  no.  1  - 
front;  no.  2  -  45°  front  right;  no.  3  -  right  side 

2.  the  number  of  incorrect  choices  made  before  the  correct 
position  was  indicated 

3.  the  total  time  between  selection  of  the  speaker  and  the 
subject's  correct  answer.  The  time  was  measured  by 
producing  a  start  pulse  for  a  Beckman  Universal  Eput  and 
Timer,  Model  8370,  when  a  speaker  was  energized  and  a 
stop  pulse  when  the  correct  position  was  indicated. 
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Figure  2.  5-1  Localization  Test  Set-Up 
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The  actual  positions  of  each  incorrect  answer  were  not  recorded. 
However,  observation  of  subjects  revealed  that  any  incorrect  choice 
was  seldom  more  than  one  position  off.  The  exception  was  in  front-back 
positions  where  the  ambiguity  was  not  so  readily  resolved  by  most 
subjects . 

A  random  noise  generator  was  used  to  excite  the  speakers.  A 
single  test  consisted  of  selecting  each  speaker  six  times  at  random. 

This  test  was  repeated  six  times  to  complete  one  subject's  series. 

After  the  third  test  the  head  was  reoriented  within  the  ring  in  order  to 
minimize  locating  certain  speakers  by  identifiable  characteristics  caused 
by  reverberations.  The  buttom  switches  on  the  subject  panel  were 
similarly  turned  by  relative  rotation  of  the  two  halves  of  an  eight  pin 
connector  so  that  each  button  remained  in  the  same  relative  position 
with  respect  to  the  head  throughout  the  series.  The  subject  was  unaware 
of  the  change.  An  initial  calibration  test  in  which  the  subject  stood  at 
the  center  of  the  ring  was  also  conducted  to  determine  any  localization 
peculiarities  and  to  permit  an  evaluation  of  the  coupling  system.  A 
total  of  eleven  subjects  were  tested. 

2.5.2  Test  Results 

1.  Figure  2.5-3  is  a  graphical  plot  of  the  number  of  mistakes 
made  in  any  one  test  (48  trials)  and  the  number  of  tests. 

2.  Figure  2.5-4  is  a  polar  diagram  of  localization  accuracy  of 
various  individuals.  Figure  2.5-5  is  the  average  accuracy 
of  six  subjects. 

A  learning  time  would  be  indicated  by  a  reduction  in  the  number 
of  incorrect  choices  and  a  corresponding  decrease  in  the  average  time  to 
select  the  correct  position.  However,  such  trends  were  not  found.  The 
conclusions  therefore  arc  that  no  learning  period  is  necessary  or  the 
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tests  were  not  conducted  over  a  sufficient  length  of  time  to  observe  any 
improvement.  The  fact  that  certain  individuals  who  were  directly  involved 
in  this  research  had  high  scores  suggests  other  reasons  for  the  lower 
scores  other  than  a  lack  of  learning.  These  might  include  mental  disposition 
vis-a-vis  the  tests ,  fatigue,  or  disinterest. 


Tebt  Numbs! 

Figure  2.5-3  Learning  Curves  of  Six  Subjects 
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5-4  Azimuth  Localization  Ability  as  a  Function  of  Direction 
Using  Aural  Coupling  -  Results  for  Four  Subjects 
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Figure  Azimuth  Localization  Ability  as  a  Function  of  Direction  Using 

Aural  Coupling  -  Average  for  Six  Subjects 
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CHAPTER  3 

PORPOISE  TRANSLATORS 


3.  1  Equipment 

3.1.1  Bases  for  Design 

The  theoretical  apsects  of  man-to-porpoise  communication  are 
discussed  in  another  section  of  this  report.  Evolving  from  the  theory 
and  from  experimental  work  are  the  bases  for  the  design  of  equipment 
to  facilitate  man--porpoise  communication  experiments: 

1.  Porpoise  whistles  (not  echo  locating  sounds)  range 
from  4  to  18  kc  for  the  most  part. 

2.  The  characterizing  time  interval  for  human  vowel 
sounds  ranges  from  400  to  2600  microseconds. 

It  was  decided  to  limit  the  initial  work  to  translation  of  vowel 
sounds  only.  The  intention  was  to  translate  the  porpoise  whistles  into 
vowel  sounds  which  when  repeated  by  an  operator  would  generate  the 
original  porpoise  whistle.  Two  instruments  were  designed  and  constructed 
for  this  purpose.  The  first  is  termed  the  porpoisc-to-man  translator,  the 
second  the  man-to-porpoise  translator.  Model  I  designs  were  fabricated 
during  the  summer;  these  Instruments  were  tested  (see  other  section) 
and  on  the  basis  of  these  experiments,  Model  II  designs  were  prepared 
and  the  instruments  fabricated  later  in  the  year.  Improvements  in  frequency 
stability,  reduction  of  long-tailed  transients,  gating  and  matching  of  the 
two  units  were  made.  Further  details  are  contained  in  later  paragraphs. 
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These  instruments  are  based  on  preserving  a  relationship  between 
the  porpoise  whistle  frequency  and  the  characteristic  time  interval  of 
the  vowel  sounds;  4  to  18  kc  in  the  porpoise  whistle  corresponding  to 
2600  to  400  microseconds  in  the  vowel  sounds, 

3.1.2  Operating  Principles  of  Model  I  Designs 

1 .  Porpoise-to-Man  Translator 

The  internal  functioning  of  the  Model  I  porpoise-to-man  circuits 
IS  shown  in  block  diagram  form  in  Figure  3.1-1.  Basically,  the  porpoise 


Figure  3.1-1  Functional  Diagram  of  Model  I  Porpoise-to-Man  Translator 
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whistle  signal  is  amplified,  then  fed  to  a  Schmitt  trigger  circuit.  This 
trigger  circuit  produces  an  output:  voltage  of  one  of  two  values  depending 
on  whether  the  signal  is  plus  or  minus.  In  this  way  the  porpoise  signal 
is  changed  to  a  square  wave.  This  square  wave  feeds  a  counting  rate 
circuit  which  produces  a  voltage  proportional  to  the  frequency  of  the 
incoming  square  wave.  This  voltage  in  turn  controls  the  frequency  of 
a  multivibrator  This  multivibrator  is  synchronized  by  the  vocal 
repetition  rate  oscillator.  The  two  waveforms  produced  by  these 
oscillators  are  combined  in  a  mixer  to  produce  the  synthetic  voice 
output  of  the  system.  In  the  actual  circuits  the  direction  of  oscillator 
frequency  change  with  porpoise  wnistle  frequency  is  controllable  and 
the  vocal  repetition  rate  is  varied  slightly  in  accordance  with  porpoise 
whistle  frequency.  The  output  signal  is  gated  off  when  the  porpoise 
whistle  frequency  Is  less  than  3  kc.  Figure  3.1-2  is  the  circuit  diagram. 
The  Instrument  is  housed  In  a  case  similar  to  that  of  the  Model  II. 

Details  of  performance  are  omitted,  there  being  detailed  information  on 
the  Model  II  instrument. 

2 .  Man-to- Porpoise  Translator 

The  internal  functioning  of  the  rr.an-to-pa.nv-j  ro  circuitry  is 
illustrated  in  the  block  diagram  of  Figure  3.1-3.  Voiced  sounds  are 
picked  up  by  the  microphone  then  amplified  and  fed  to  a  Schmitt  trigger 
circuit.  The  trigger  circuit  changes  the  signal  waveform  to  a  square  wave. 
The  output  transitions  occur  at  the  zero  crossings  of  the  input.  The 
period  to  voltage  circuit  produces  an  output  voltage  linearly  related 
to  the  period  between  positive  and  negative  going  transitions  of  the 
square  wave.  The  value  obtained  is  clamped  and  held  until  the  next 
value  is  obtained.  A  measure  is  made  for  each  plus  to  minus  transition. 
The  voltage  obtained  fay  this  process  is  filtered  slightly  and  then  used 


-46- 


NOTS  TP  3X09 
Part  4 


Figure  3.  1-3  Functional  Diagram  of  Model  I  Man-to-Porpoise  Translator 

to  control  the  frequency  of  a  variable  frequency  oscillator.  Output  of 
this  oscillator  mixes  with  the  output  of  a  fixed  frequency  oscillator 
producing  a  beat  frequency  which  varies  from  4  to  18  kc.  This  beat 
frequency  is  filtered  out  from  the  composite  signal  obtained  from  the 
mixer  and  is  the  output  of  the  system.  A  gate  serves  to  shut  off  the 
output  in  the  absence  of  a  voiced  signal.  Figure  3.  1-4  is  the  circuit 
diagram.  This  instrument  is  portable,  being  contained  in  an  instrument 
case  similar  to  the  Model  II  instrument.  Details  of  performance  are 
omitted  so  that  full  information  may  be  presenled  on  the  Model  II 
instruments. 
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3.1-3  Model  II  Instruments  (Figures  3.1-6,  3.1-7) 

1 .  Porpoise-to-Man  Translator 

The  internal  functioning  of  the  Model  II  porpoise-to-man  translator 
is  illustrated  in  Figure  3. 1-5.  The  difference  between  this  and  the 
Model  I  instrument  is  in  the  method  of  deriving  a  voltage  proportional 
to  input  frequency.  In  the  Model  II  circuit  the  number  of  cycles  of 
signal  in  a  12-5  millisecond  period  are  counted,  a  voltage  proportional 


Figure  3.  1-5  Functional  Diagram  of  Model  II  Porpoise-to-Man  Translator 
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Por  pois  e  -  To  -  Man 
Translator 


Man- To- Porpoise 
Ti  dtioi&tor 


Figure  3.  1-7  Internal  Construction 
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to  this  count  is  obtained,  clamped  and  held  until  the  next  measure  is 
obtained.  This  method  of  obtaining  a  voltage  proportional  to  frequency 
eliminates  transients  due  to  the  averaging  circuits  required  in  the 
standard  counting  rate  circuits.  For  sudden  increases  or  decreases  of 
input  frequency,  steady  state  response  occurs  within  25  milliseconds. 

In  other  respects ,  the  block  diagram  is  the  same  as  for  the  Model  I 
circuit.  Figure  3.1-8  is  a  circuit  diagram.  Here  several  significant 
differences  are  apparent:  (1)  the  frequency  to  voltage  circuit  is  much 
more  complex  than  that  previously  used.  (2)  the  interval  oscillator 
is  of  a  different  and  improved  design.  (3)  the  mixer  and  gate  transistor 
is  a  special  purpose  one  and  (4)  gate  control  is  by  a  Schmitt  circuit  to 
provide  a  positive  action. 

Performance  characteristics  are  presented  following  the  discussion 
of  the  man-to- porpoise  translator. 

2.  Man-to-  Porpoise  Translator 

The  Model  II  man-to- porpoise  translator  has  the  same  internal 
functioning  as  the  Model  I  circuit  shown  in  Figure  3.  1-4.  Circuit 
improvements  have  been  made.  These  may  be  seen  in  the  diagram  of 
Figure  3,  1-9  to  be:  (1)  The  period  to  voltage  circuit  has  been  simplified 
and  includes  an  improved  gate  circuit.  (2)  A  special  design  of  buffer 
amplifier  is  used.  (3)  Special  purpose  gate  transistors  are  used. 

(4)  Output  gating  Is  controlled  by  a  Schmitt  trigger  which  assures 
positive  action. 

Performance  characteristics  are  presented  in  the  next  section. 
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3.  1-4  Circuit  Behavior 
1.  Porpoise-to-Man  Translator 

Figure  3. 1-10  illustrates  the  translation  of  4  to  18  kc  sine 
waves  into  synthetic  speech.  Figure  3.  1-11  illustrates  the  reverse 
process  with  sine  wave  input  to  the  man-to -porpoise  translator. 

Figure  3,  1-12  illustrates  the  behavior  when  the  output  of  the  porpoise- 
to-man  translator  was  used  to  feed  the  man-to-porpoise  translator. 

This  test  translates  a  whistle  to  a  synthetic  speech  and  then  from  the 
synthetic  speech  back  to  whistle.  Figure  3.  1-13  Illustrates  the 


reverse  process, 
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Figure  3.1-12  Tracking  Characteristic  of  Translators 
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Figure  3.1-13  Tracking  Characteristic  of  Translators 
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3.  2  Theoretical  Basis  for  Translators 

The  mechanism  of  time  delays  which  has  been  found  to  be  essential 
in  sound  localization  was  extended  to  speech  recognition.  As  reported 
earlier  [Ref.  2]  ,  speech  is  presumed  to  be  formed  by  a  delay  mechanism 
similar  to  that  provided  by  the  pinna  wherein  the  sound  originating  in  the 
larynx  reaches  the  front  of  the  mouth  by  diverse  paths  defined  by  the  tongue .. 
cheeks,  teeth,  lips,  nasal  passages,  etc.  An  oscillographic  study  of 
phoneme  wave  forms  revealed  that  each  phoneme  has  a  characteristic  time 
delay  irrespective  of  the  speaker.  The  delay  times  for  phonemes  "oh"  to 
"ee"  were  estimated  to  be  from  2000  4  sec  to  400  p.sec.  Thus,  in  the 
translation  scheme  used,  it  was  decided  to  process  the  frequency  modulated 
porpoise  whistles  so  that  a  4  kc  frequency  produces  a  2000  4  sec  character¬ 
istic  time  and  an  18  kc  frequency  produces  a  400  j u  sec  characteristic  time 
in  the  synthetic  speech  output  from  the  porpoise-to-man  translator.  Inter¬ 
mediate  frequencies  produce  a  period  inversely  related  to  frequency.  Vocal 
cord  repetition  rate  was  arbitrarily  set  at  approximately  100  pulses  per  second. 

The  reverse  mode,  the  translation  of  voiced  sounds  to  porpoise-like 
whistles,  is  programmed  in  a  similar  way.  A  characteristic  time  of  2000 
4  sec  is  translated  to  a  whistle  frequency  of  4  kc  and  a  400  u  sec 
characteristic  time  is  translated  to  an  18  kc  frequency.  Translation  of 
intermediate  frequencies  is  on  a  frequency  proportional  to  inverse 
period  basis. 

3 . 3  Test  of  Translators 

The  experimental  work  with  porpoises  using  the  translators  was 
most  gratifying.  The  tests  were  conducted  at  the  U.  S.  Naval  Ordnance 
Test  Station  facility  at  Point  Mugu, 
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A  set  of  words  was  selected  for  use  in  testing  mimicry  with  the 
porpoise.  The  words  used  were  "ball/'  "ezra,"  "peterson,"  and  "focina." 
Of  these  only  "ball"  was  reproduced  accurately  by  the  porpoise -to-man 
translator.  Dependence  on  simple  recognition  allowed  the  remaining  words 
to  be  used.  A  half-four  tape  recording  of  porpoise  sounds  was  made  for 
reference.  Then  the  word  "bali'1  was  spoken  into  the  man-porpoise 
translator.  Silence  ensued,  but  after  three  repetitions  of  the  word, 
the  porpoise  vocalized  a  similar  sound  as  translated  by  the  porpoise- 
to-man  unit.  Several  minutes  of  this  exchange  followed,  with  reinforcement 
by  feeding  mackerel,  until  it  seemed  that  mimicry  was  present.  The 
selection  of  vocabulary  by  Lite  puipoise  during  this  time  was  limited  almost 
entirely  to  "ball" -like  words.  The  same  process  was  repeated  with  "ezra" 
with  apparent  mimicry  by  the  porpoise ,  but  with  more  frequent  use  of  his 
standard  vocabulary.  The  word  “peterson”  followed. 

Upon  the  Introduction  of  "peterson,"  still  more  of  the  standard 
vocabulary  was  used  by  the  animal,  and  mimicry  did  not  seem  to  occur. 
However,  an  interesting  thing  did  occur.  When  silence  reigned, 

"peterson."  was  spoken  into  the  translator  and  the  porpoise  responded 
immediately  with  “why-oh."  This  was  repeated  at  least  ten  times.  It 
was  concluded  that  "peterson"  would  not  be  repeated.  The  same  reaction 
occurred  with  "focina." 

The  experiment  was  ended  at  this  point  with  reservation.  Respon¬ 
siveness  of  the  porpoise  was  certain  whereas  mimicry  was  somewhat  less 
certain.  Communication  was  definitely  present  but  the  game  suffered  from 
lack  of  meaning  for  the  porpoise. 

Perhaps  the  most  important  result  was  the  knowledge  that  the  design 
and  conduct  of  future  experiments  must  include  consideration  of  the  animals’ 
reactions  to  intrusions  on  his  environment.  Meaningful  results  can  be 
expected  only  when  the  test  is  designed  so  as  to  insure  that  the  porpoise 
retains  complete  control  of  his  environment  even  while  responding  to  outside 
influences. 
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CHAPTER  4 

THEORY 


4 . 1  Introduction 

At  this  reporting  point  it  seems  desirable  to  review  the  study  of  the 
role  of  the  pinna  in  human  localization  in  order  to  make  a  coherent  whole  of 
what  has  br'en  done.  Although  the  study  of  this  particular  topic  began  with 
the  demonstration  that  distorting  the  pinnae  distorts  the  perception  of  locale , 
these  are  several  background  ideas  which  are  essential  to  the  formation  of 
hypotheses  and  theories  regarding  the  function. 

The  first  background  theorem  concerns  the  number  of  independent 
indices  necessary  to  define  a  point  and  the  consequent  requirements 
regarding  transformations  of  a  space  of  points.  If  the  point  is  three- 
dimensional,  requiring  three  indices  to  define,  then  not  less  than  three 
indices  must  be  preserved  in  transformation  if  the  character  of  the  space  is 
to  be  preserved.  If  a  sound  source  is  located  in  a  room,  then  a  coordinate 
system  of  referrent  and  three  referreds  is  necessary  to  specify  its  location. 

If  the  information  space  is  acoustical,  then  there  must  be  a  referrent 
and  three  referred  acoustical  features  to  the  point  in  order  to  make  a  cor¬ 
respondence  in  location  between  the  physical  space  and  the  acoustical 
space.  The  sound  from  the  sound  source  musl  therefore  be  complex,  con¬ 
sisting  of  at  least  four  sine  wave  components  of  differing  frequencies.  This 
requirement  is  met  by  transient  sounds,  usually  consisting  of  a  broad  spectrum 
of  components ,  more  than  sufficient  to  form  a  basis  for  measurement  at  a  point. 

There  may  be  a  joint  arrangement  of  physical  and  acoustical  features 
in  the  measurement.  If  measurement  is  made  at  two  points  in  the  physical 
space,  the  requirement  on  the  acoustical  space  is  reduced  by  one  dimension, 
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and  so  on.  If  measurement  is  made  at  a  coordinate  set  of  points  in  physical 
space  .  the  sound  source  may  consist  of  a  single  sine  wave. 

The  second  background  theorem  concerns  the  transfonnation  inverse 
to  the  one  of  perception.  The  initial  or  perceptual  transformation  forms  a 
correspondence  between  the  source  and  the  observer,  it  is  then  necessary 
to  form  a  correspondence  between  the  observer  and  the  source  which  properly 
identifies  the  source  coordinates.  If  the  inverse  transformation  cannot  be 
constructed,  then  the  location  of  the  source  remains  unknown.  In  general, 
only  approximate  or  probabilistic  inverse  transformations  are  constructable. 
This  may  be  considered  as  correspondence  in  measured  subsets  of  the  point 
sets  concerned. 

4 . 2  Transformation  by  the  Pinna 

Given  the  background  requirements ,  the  pinna  must  introduce  a 
characterizing  transformation  between  the  sound  source  and  the  eardrum 
which  preserves  the  dimensionality  pertinent  and  for  which  an  inverse  can 
be  constructed.  Our  early  observations  showed  that  the  pinna  introduced 
a  number  of  paths  of  different  lengths  between  the  sound  source  and  the 
eardrum.  The  transfonnation  thus  defined  showed  variation  with  azimuth 
angle,  altitude  angle,  and  range  such  that  the  delay  lengths  for  azimuth 
angle  and  altitude  angle  could  be  measured,  but  the  range  Information  was 
less  obvious  and  remains  to  be  well  specified. 

In  order  to  understand  the  apparent  character  of  the  transformation, 
we  Introduced  simplified  models.  Of  these,  the  first  was  the  "two-hole 
coupler'1  shown  in  Figure  4.2-1.  This  model  was  patterned  after  the 
"directional  coupler"  familiar  to  users  of  wave  guides,  but  has  a  range  of 
angles  between  0-18Cc  instead  of  only  the  two  limit  points. 
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Figure  4.2-1  Two  Hole  Coupler 


If  a  sound  pulse  Is  propagated  at  the  angles  Indicated,  the  resultant 
output  from  the  microphone  (neglecting  reverberation)  will  be  as  shown  in 
Figure  4, 2-2. 


Figure  4.2-2  Signals  from  the  Two  Hole  Coupler 


6 
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A  measurement  of  the  time  between  pulses  would  provide  the 
information  from  which  the  angle  of  arrival  could  be  computed.  The  time 
between  pulses  at  the  microphone  follows  equation  (4,2-1) 


T  = 

~  (1  +  cos  A) 

V 

(4.2-1) 

T  = 

time  between  pulses 

(4.2-2) 

s  - 

distance  between  holes 

(4.2-3) 

V  = 

velocity  of  sound 

(4.2-4) 

A  = 

angle  of  normal  to  wave  front 
with  hole  line 

(4.2-5) 

Following  the  two  hole  coupler,  a  “three  hole  coupler"was  examined 
as  a  model.  The  three  hole  coupler  may  be  considered  as  a  coordinate  pair 
of  two  hole  couplers  with  a  common  origin  or  referrent.  A  three  hole  coupler 
is  shown  in  Figure  4.2-3. 


Figure  4.2-3  Three  Hole  Coupler 
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In  order  for  the  inverse  transformation  to  provide  a  unique  correspondence 
between  pulse  time  separation  and  azimuth  angle  and  altitude  angle  ,  it  is 
necessary  to  separate  the  domains  by  a  fixed  delay  path  as  shown  in  the 
figure.  The  equations  pertinent  are  as  follows: 


T1 

= 

se 

(1  +  cos  0 ) 

V 

(4.2-6) 

t2 

= 

kSe  ST 

- + - (1  +  COS  'J') 

V  V 

(4.2-7) 

T1 

= 

delay  corresponding  to  6 

(4.2-8) 

T  2 

= 

delay  corresponding  to 

(4.2-9) 

se 

= 

distance  between  holes  on  0  line 

(4  2-10) 

s* 

= 

distance  between  holes  on  &  lino 

(4.2-11) 

K  > 

2 

domain  separating  delay  path 

These  models  provided  insight  into  possible  manners  of  transformations 
defining  angles  ,  but  did  not  provide  for  range  information.  There  are  a  number 
of  possibilities  as  yet  unsettled  regarding  the  inferrence  of  range.  In  examining 
the  requirements  for  acoustic  transformation  to  provide  localization,  a  geometric 
fojni  was  evolved  which  is  similai  to  an  ear.  For  localization  in  azimuth,  the 
transforming  mechanism  should  be  altitude  independent  and  function  over 
180  degrees.  Figure  4.2-4  was  sketched.  The  reflector-diffractor  was  made 
half-round  to  provide  180  degree  effects,  whereas  a  full  round  would  be 
ambiguous  because  of  symmetry. 

A  similar  structure  would  provide  altitude  angle  information,  but  must 
be  displaced.  The  focal  point  could  be  ducted  to  the  microphone  to  produce 
the  geometry  sketched  in  Figure  4.2-5. 
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Figure  4.2-4  A  Diffration  System  Providing  Azimuth  Delays 


Microphone 


Altitude  Diilraciur 


Azimuth  Diffractor 


Figure  4.2-5  A  Diffractor  System  Providing  Delays  in  Azimuth  and  Altitude 


p 
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It  is  evident  that  only  the  curvature  of  the  incident  wave  front  can 
provide  range  information  to  the  transformation  being  considered,  so  that 
such  measurement  will  be  poorly  resolved  at  best.  A  second  diffractor, 

Figure  4.2-6,  would  provide  curvature  information  by  small  delay  differences, 

shown  by  double  pulses  in  Figure  4.2-7.  Note  that  the  coordinates  may 

also  be  skewed  rather  than  orthogonal,  suggesting  the  geometry  of  the  human  ear. 

The  pertinent  equations  are  formed  by  two  sets  of  three  hole  coupler 
equations . 

4 . 3  Inverses  to  the  Pinna  Transformation 

Assuming  that  the  transformations  described  are  pertinent,  as  supported 
by  observation,  the  means  of  forming  the  inverse  correspondence  becomes 
the  next  question.  This  was  approached  by  attempting  mathematical  formu¬ 
lations  of  the  inverse  transform.  The  inverse  first  constructed  is  essentially 
that  for  the  two  hole  coupler,  and  can  be  done  in  several  wayrs.  The  first  way 
is  as  follows: 

Given  a  signal  after  passing  through  a  delay 

f (t)  +  af  (t  —  t) 
f(t)  =  source  sound 
r  =  time  delay  in  reflection  path 
a  =  attenuation  factor  at  path  reflection 

Construct  the  series 
M 

^  (-l)n  aR  [f(t  -  nr)  +  af  (t  -  nr  -  r)] 

n  =  0 

M  M 

=  Y  (-l)n  an  f(t  -  nr)  +  V  (_l)n  an+  1  f(t  -  r  -  nr) 

L-t  Lj 

n  =  0  n  =  0 
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Figure  4.2-6  A  Diffraction  System  Providing  Delays  in  Azimuth, 
Altitude ,  and  Range 


Figure  4.2-7  Hypothetical  Presence  of  Range 
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M 


=  f(t)  +  ^  (-l)n  a11  f(t  -  nr) 

n  =  1 
M  +  1 

+  y  (-l)”"1  an  f(t  -  nr) 


n=  1 
M 


M 


=  £(t)  +  y  (-l)n  a'1  f(t  -  tit)  -  y  (-l)nan  f(t  -  nr) 


n  =  1 


,  .,M  +  1  M  +  1 
+  {-!)  a 


n  =  1 

f(t  -  T  -  Mt) 


,,,,  t  +  1  M  +  1  ..  > 

f(t)  +  (-1)  a  f(t  -  r  -  Mr) 


=  h(t) 

11m  h(t)  =  f(t) 
M-*-oo 


a  <  1 


Although  the  first  method  produces  a  satisfactory  Inverse,  there  Is  a 
second  way  which  converges  more  rapidly. 

Let  P  (t)  =  f(t)  +  af  (t  -  r)  be  the  signal  after  delay  and  addition. 
Then  construct  a  series 

„n-2  n-2  „ 

Pn(t)  =  Pn  l(t)  +  (-ir  ?n_  ^  (t  -  2n  t) 


2  <  n  <  « 

And  by  examination,  we  find  that  a  consequence  is  that 

9n~2  9 

P  =  fit)  -  a  f(t  -  r) 

n 
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Using  the  rule  of  construction  and  P  we  find 

n 


And 


n+  1 


,n-2 

f(t)  -  a  f(t  -  2  t) 


„n-2  n-2 

+  a“  (-1)  f(t  -  2n  t) 

.n-2  „n-2  .n-2  .  . 

2  2  /  . t2  i  n  2  2  ^ 

-a  a  (-1)  f  (t  —  2  t  —  2  r) 

.n-2  .  , 

=  f(t)  -  a  f(t  -  2“_Z  r) 

+  a2  f(t  —  2  :2  r) 


,n-2  „n-2 


-  a  a 

.n-2  2 

(a‘  )  =  a' 


f(t  —  2n"2  r-  2n  2  r) 


,n-l 


and 


.n-2  .  .n-2  „n- 1 

2  r+2  t  =  2  r 


n+l 


9n-l 

f(t)  -  f(t  -  2n  1  t) 


By  finite  induction,  starting  with  P.,  the  expression  of  P  ,  P  ,  ,  in  terms  of 

z  n  n+ 1 

f(t)  etc.  is  correct. 

Of  course 

lira  P  =  f(t)  a  <  1 

n 


The  first  mathematical  model  can  be  realized  by  an  arrangement  of 
delays,  attenuations,  and  signed  additions  according  to  the  sketch  of 
Figure  4.3-1. 
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Figure  4.3-1  Construction  of  Series  Inverse  at  Basilar  Membrane 

The  second  mathematical  model  can  also  be  realized  by  an  arrangement 
of  delays,  attenuations,  and  signed  additions  according  to  the  sketch  of 
Figure  4.3-2. 
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If  we  introduce  the  Laplace  transform,  the  mathematical  expressions 
can  be  greatly  simplified,  although  the  details  of  construction  are  not  evident 
as  in  the  time  models. 


A  delay  is  expressed  as 


The  transformed  signal  of  the  first  model  is  thus 


F(s)  (1  +  ae  ST)  =  H(s) 
From  which  the  inverse  is  obviously 

..-His).  — 


F(s) 


-sr 


1  +  ae 

Using  the  Laplace  transform  permits  the  expression  of  the  more  complex  system 
of  a  spanning  transformation. 

The  initial  signal  in  this  representation  is 

-sr  -sr„  -sr„ 

H(s)  =  F(s)(l  +3^  +  a2f  +  ) 

for  a  spanning  set.  However,  we  might  generalize 

N 


H (s)  =  F(s)  2  ake 
k  =0 


-sr 


The  inverse  for  this  in  transform  representation  becomes 

I  h 


F(s)  =  H(s)  ^  (-1)* 


h  =  0 


N 


1  ak' 


"ST, 


k  =  1 


The  time  domain  realization  of  F(s)  is  then  a  series  of  delays  with  particular 

attenuation  factors.  Obviously  an  approximation 

Ih 


II 

F(s)  =  H(s)  £  (-l)h 
h  =  0 


N 

2  V 

k  =  1 


J 


a.  *  1 
u 

T0  -  0 


converges  when  the  reflection  factors  are  less  than  unity. 
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The  inverse  can  be  constructed  as  shown  by  the  equation  as  continued 
networks  of  networks  similar  to  the  one  shown  in  Figure  4.3-3. 


Figure  4.3-3  Series  Inverse  for  Several  Delays 

However,  a  third  mathematical  model  is  possible  using  a  feedback 
system  as  shown  in  Figure  4.3-4. 


Figure  4 . 3-4  Feedback  System  for  Inverting  a  Set  of  Three  Delays 
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The  mathematics  are  as  follows  for  the  two  hole  coupler 
H(s)  =  F(s)  (1  +  ae"Sr) 

The  "error  signal"  in  the  feedback  system  at  the  subtractor,  ©  is 

E(s) 

The  error  signal  after  delay  is 

E(s)  s“ST 

The  feedback  equations  are 

E(s)  =  H(s)  -  E(s)  ae  S' 

e(s)  (l  +  at  }  —  H{s) 

E(s)  =  - =  F(s) 

1  +  ae“ST 

And  for  the  more  complex  network  shown: 

E(s)  (1  +  aj6  1  +  a2t  2  +  a^e  3)  =  H(s) 

E{s)  =  F{s) 

4 . 4  Extensions  of  Theory 

With  the  transformations  introduced  by  the  pinna  defined  in  theory 
and  measurement,  and  the  Inverses  constructable  by  any  of  the  models 
given,  it  is  possible  to  extend  the  examination  of  the  function  of  human 
hearing  to  the  treatment  of  reverberation.  One  may  observe  that  the  external 
ear  or  pinna  provides  a  characteristic  reverberation  by  the  introduction  of 
multiple  paths  for  the  sound  to  reach  the  eardrum,  Simiiarly,  a  room  provides 
multiple  paths  between  the  sound  source  and  the  perceiver.  In  theory  then, 
the  significant  difference  between  the  roie  of  the  pinna  and  room  reverberation 
is  one  of  time  domain  and  multiplicity  of  paths  . 
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The  reverberation  of  a  room  can  be  characterized  as  follows: 


F(s)  =  sound  at  source 

R(s)  =  sound  at  percelver  as  modified  by  the  room 

oc 

^  -sr 

R(s)  =  F(s)  >  a  e  n 
U  n 

n  =  0 


If  we  consider  first  the  effect  of  two  parallel  walls  of  infinite  extent 
as  shown  in  Figure  4,4-1,  the  equation  can  be  simplified 


Figure  4.4-1  A  Simple  Reverberant  Situation 

-St.  -st  -st. 

R(s)  =  F(s) [ a„ e  +  a  e  +  a.,c  ] 

The  inverse  to  this  equation  is  simply  constructed  by  the  methods  given, 
thus  removing  the  effect  of  reverberation.  It  would,  however,  be  preferable 
to  use  the  incident  sound  from  the  several  paths  to  improve  the  signal-to- 
nolse  ratio  for  a  particular  condition. 

If  a  finite  set  of  delayed  signals  Is  considered ,  having 

r  =  maximum  delay 
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M 


R(s) 


F(S)  £  an 


p  =  0 


and  the  neural  transformation  Tfs)  constructed 

M  ,  > 

.  -3(r.,— t 

\  M  n 

T(s)  =  >  a  e 

Lj  n 

n  =  0 

the  resultant  mental  signal  P{s)  is  given 


P(s)  =  R(s)  T(s) 
M 

V 


M 


=  F(s)  2,  v  J  1  v 


■8(tM  "  v 


=  c 


k=0 


=  F(s)  e 


M  M 

-ST.,  „  -ST  „  +ST 

M  V  J  \  k 

2  V  2 


v 


i  -  o 


k  -  0 


The  result  Is  the  original  signal  delayed  by  and  modified  by  the  product  of 
sums.  The  product  of  sums  can  be  represented  as  follows: 

M  M  L  M 

"ST.  -v  '  S T ^  ^  2 

;  a.  e  J  }  a,  c  «=  )  a  +  cross  terms 

L  J  L  k  L  n 


j=0 


k  =  0  n  •-  0 

M  M 


cross  terms  =  ^  )  a,  a,  f  "a 

Lj  L 


-ST,  't  S7 

k  ..  j 


VV  '  1  /  " 

k  •=  0  3=0 

In  this  product,  the  cross  term  result  for  j  ^  k  can  be  given  as  follows 


-ST  +  ST 

.  X  X, 

a  (t  +  e  ) 


T  ■ —  T 

k  j 


j  r  k 


a,  a,  =  a.  a, 

k  J  1  k 
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This  form  indicates  a  symmetrical  shift  about  the  maximum  delay  in  the 
general  expression,  and  in  the  frequency  domain  indicates  no  phase  shift 
of  components  but  an  amplitude  change  between  +  a^  and  -a^a^.  Thus 
such  terms  can  be  called  "coloration."  The  general  expression  then 
becomes 

-st. 


M 


M 


P(s)  =  F(s)e  *”  ^  a  “  +  coloration 

n  =  0 

The  perceived  signal  would  then  be  increased  in  amplitude  by 

M 


I 


n 


n  =  0 

and  colored  by  terms  of  the  form 

+  e  +sx) 

none  of  which  could  alter  the  result  by  more  than 


For  example,  let. 


±aJ  \ 


1 

.9 

.8 

.7 

,6 


The  increase  in  amplitude,  I,  is  then 
I  =  3.30 


and  the  maximum  coloration  ,  C_ . 

JVi 


C. .  = 

M 


a,,  a.  =  .9 
u  i 
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The  subjective  result  of  this  method  of  attention  would  be 

a)  Increased  loudness 

b)  Perceived  with  delay 

c)  Colored  somewhat  in  spectrum 

Where  sounds  are  weak,  this  method  can  be  preferred  to  the  direct 
inverses  and  appears  to  be  used,  from  our  subjective  test  regarding  position 
and  coloration. 


It  should  also  be  evident  that  transformations  by  the  pinna  can  be 
treated  in  the  same  wav. 


In  an  examination  of  the  extensions  of  the  theory  it  now  seems  reasonable 
to  state  tentatively  several  domains  in  time  regarding  correlations  and  inverses: 


0  ix  sec  <  <  300  jisec 

0  <  D2  <  000  ii  sec 

600  n  sec  <  Dg  <  2200  /r  sec 

2. 2  ms  <  <  10  ms 

10  ms  <  D.  <  200  ms 
4 


Pinna 

Interaural 

Speech 

Music 

Reverberation 


The  remaining  concerns  for  human  hearing  have  been  briefly  presented 
in  the  previous  final  report ,  It  Is  hoped  that  these  may  all  be  combined  in 
the  near  future  into  a  coherent  statement  regarding  human  hearing,  localization, 
attention  and  recognition  as  well  as  models  of  possible  nerve  and  synoptic 
systems  to  accomplish  the  theoretical  and  experimental  results. 
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CHAPTER  5 

CONCLUSIONS 


5, 1  Hydrophones 

It  is  now  apparent  that  a  hydrophone  suitable  for  underwater  sound 
localization  must  be  designed  as  an  integral  part  of  the  mechanism  which 
introduces  the  localization  transformation.  Implied  is  not  only  the  perform¬ 
ance  specifications  outlined  earlier,  but  also  the  material  and  the  configuration 
of  that  mechanism.  Furthermore,  it  is  now  felt  that  enlarged  ear  replicas  do 
not  provide  the  optimum  mechanism.  While  their  use  allows  subjective 
evaluation  of  remote  environment  localization,  they  do  not  model  accurately 
in  water  as  do  normal  size  ears  In  air.  It  may  be  stated  simply  that  for 
preservation  of  time  delays  an  enlargement  by  the  sonic  velocity  ratio  is 
sufficient.  Plowever,  accurate  acoustic  modeling  requires  enlargement  by 
the  square  of  the  ratio.  Hence,  a  greater  emphasis  will  be  placed  on  the 
synthesis  of  the  ear's  function  by  a  combination  of  hydrophones  and  fixed 
delays. 


S .  2  Headphones 

The  condenser  headphones  described  earlier  adequately  transfer  the 
localization  function.  Nevertheless,  further  improvements,  can  be  made  in 
dynamic  range,  low  end  response,  and  sensitivity;  however,  no  further 
research  is  contemplated  in  this  area.  It.  Is  expected  that  manufacturers 
of  available  products  will  introduce  in  the  near  future ,  headphones  which 
will  hopefully  surpass  the  performance  of  the  condenser  headphones. 
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5 . 3  Effects  of  Structure 

The  reflection  and  diffraction  effects  of  structure  between  the  ears 
has  been  shown  1:o  be  ineffective  in  changing  the  localization  ability  with 
electronic  systems.  In  addition  it  has  been  found  that  the  intoraurai 
distance  may  be  changed  without  affecting  localization. 


S  .4  Theory 

The  theoretical  aspects  of  localization  research  have  provided  a 
deeper  insight  into  the  general  area  of  acoustics.  Some  of  these  ideas 
with  mathematical  support  are  collected  below.  Others  are  included  in 
Reference  3. 


Given  a  signal  modified  by  one  delay  and  attenuation 


H  (s)  =  P(s)  Cl  +  ae-ST) 


An  inverse  can  be  constructed  directly 

.His] 


F(s)  =  — Mi§l —  =  H(s)  V  an 

i  l—1 


-snr 


1  +  ae  n 

n  =  0 

Utilizing  feedback  techniques  to  find  the  inverse  gives 

E(s)  =  H(s)  —  E(s)ae  ST 


E(s 


_H.(s)  _ 

-ST 


F(s) 


1  +  ae 

When  feedback  circuits  are  used,  the  inverse  to  a  multiple  delay  is  simply 

M 


T  S  ST 

F(s)  =  H(s)  )  ane  n  a0 


n  =0 


tq  =  n 


M 


E(s)  =  H(s)  -  E(s)  ^  ane  “ 


n  -- 1 
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When  correlation  and  recognition  are  both  desirable  a  correlation 
function  can  he  constructed.  Given  a  signal  F{s)  multiply  delayed 

M 

^  -sr 

H(s)  =  F(s)  )  a  e  11 
n  ~  0 

construct  the  correlation  function 


M 


T(s»  '  I 


a  e 
n 


-s(t  -  t  ) 
M  n 


n  =  0 


=  maximum  delay  in  original  signal 
then  M 

"  S  T__  1  r> 

H(s)T(s)  =  F(s)e  y  an  +  cc,loration 

n  =  0 

If  the  signal  is  reverberant  on  a  simple  length 

oc 

\  V  n  -snr 
His)  =  FIs)  }  a  r 
Lj 
n~0 

then  the  Inverse,  or  removal  of  reverberation  can  be  done  by 
H(s)I(s)  =  F(s) 

where 

I(s)  =  1  -  ae ST 

If  the  reverberation  includes  a.  sign  change 

cc 

H(s)  =  F(s)  £  (~l)n  ane”snT 
n  =  0 

then 

H(s)  I (s)  =  F(s) 

when 

I(s)  =  1  +  ae  ST 
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If  an  object  has  an  acoustic  shape,  such  as  a  cube,  tetrahedron, 
or  fish ,  the  character  is 

M 

-sr 


H(s)  =  F(s)  ^  a^e  n 


which  may  be  called  "external  reverberation.  "  The  object  Is  recognized  by 


H(s)  T(s)  =  maxi-mum  peak 

If  the  object  is  the  same  shape  but  variant  in  size,  the  scale  factor  k  is 
introduced 

H2(s)  =  F(s) 

n  =  0 

C2(s)  =  CjOcs) 

Thus  the  forms  of  the  recognition  transforms  are  identical  for  the  same  shape. 


M 

I 


-skr 


a  £ 
n 


If  the  object  Is  viewed  at  different  aspects  ,  the  aspect  transformation 
becomes 


M 


H 


>(s)  =  F(s)  I  V 


-sr  -s(cos9  +  *  )  r 


n  n 


n  =  0 


0  =  angle  of  view 

If  the  aspect  transformation  is  multiplied  by  a  rotation  matrix  which  can 
be  inverted  by  a  counter  rotation  matrix,  recognition  is  possible  from  any 
angle.  Thus  size,  shape,  and  aspect  are  all  recognizable.  Distance  does 
not  alter  the  characterizations. 


If  the  object  is  internally  reverberant, 

M  oc 

.  V  V  k 

ms. i  =  ns>  )  /  a 

P  Lj  n 

n  =  0  k=0 


as  a  violin ,  then 

-sk  r 
n 
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and  rotation  corresponds  to  time  relationship  shifts  in  n ,  but  each  string  k 
remains  invariant  with  aspect.  A  violin  can  be  recognized  easily  at  any 
aspect,  but  aspect  colors  the  sound. 

If  reflecting  walls  are  acoustically  colored,  then  the  attenuation 
coefficients  are  functions  of  s 

M  _gT 

H(s)  =  F(s)  £  an(s)e  n 
n  =  0 

The  forms  of  correlations  and  inverses  are  unchanged. 


If  the  delays  are  not  discrete,  but  smeared,  then  the  form  becomes 

00 

H  (s)  =  F{s)  y  a  (t)  e  ST  dr 
0 


=  F(s)  a(s) 


or 


F  (s )  = 


JLM 

a  (s ) 


and  the  time  domain  is  recovered  by 

f(t)  -  L"1  [  F(s)]  =  J*  eST  ds 


which  suggests  the  correlation  transform. 

If  s.peech  is  characterized  by  delays,  the  recognition  transform  T(s) 
docs  not  change  with  differentiation  of  the  signal 

M 

"ST 

sp  H(s)  =  sp  F(s)  )  ane  n 
n  =  0 
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which  is  recognized  by 


T(s) 


M 

V 


■s(tm" 


T  ) 

n 


n=  0 

The  recognition  delays  are  invariant  to  p  and  q„  However,  the  condition 

q  -  p  is  a  coloration  adjustment  which  might  be  made  if  p  were  known. 


In  resume,  it  is  possible  to  localize,  pay  attention,,  recognize  in  noisy, 
reverberant  surroundings,  with  cognizance  of  size  and  aspect,  for  sounds, 
objects,  and  environments,  with  some  freedom  from  particular  spectrum  require¬ 
ments,  by  correlations  for  maximum  peakedness  ,  and  inverses  for  maximum 
bandwidth ,  by  methods  using  delays ,  attenuations  and  signed  additions  ,  which 
appear  to  be  the  counterpart  of  the  human  system  of  hearing.  This  room  sounds 
like)  this  room  whether  from  air  noise,  speech,  traffic,  or  clatter;  a  thing  is 
such  and  such  by  pulse, sweep,  or  any  spanning  stimulus.  A  violin  has  the 
sound  shape  of  a  violin.  Porpoises  recognize  fish,  bats  find  their  food,  people 
listen  to  speech.  This  by  means  simply  conceived,  easily  realizable  in  the 
nervous  system,  but  likely  to  require  large  numbers  of  elements  for  general  use. 
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